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Abstract 

When  one  works  with  a  system  that  utilizes  inheritance  hierarchies  the  following  problem  often 
arises.  A  new  object  is  introduced  and  it  must  be  integrated  into  a  hierarchy;  under  which  classes 
in  the  hierarchy  should  the  new  object  be  positioned?  In  this  paper,  I  formalise  this  problem  for 
feature-based  default  inheritance  hierarchies.  Since  it  turns  out  to  be  NP-complete,  I  present  an 
approximation  algorithm  for  it.  I  show  that  this  algorithm  is  efficient  and  look  at  some  of  the 
possible  problematic  situations  for  the  algorithm.  Although  more  analysis  and  experimentation  are 
needed,  these  preliminary  results  show  that  the  algorithm  warrants  such  efforts. 
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1  Introduction 


Recent  computational  linguistics  research  into  natural  language  lexicons  has  gone  beyond  simply 
listing  idiosyncratic  information  about  words.  Many  researchers  are  adding  some  sort  of  structure 
to  their  lexicons  to  replace  the  old  practice  of  simply  listing,  in  an  entry  for  a  word,  all  its  properties. 

This  structure  often  takes  the  form  of  feature-based  default  inheritance  hierarchies.  For  example, 
such  hierarchies  are  being  utilized  in  lexicons  for  speech  processing  [Gibbon  and  Bleiching,  1991],  nat¬ 
ural  language  understanding  systems  [Andry  ei  al.,  1992;  Copestake  ei  ai,  1991;  Russell  ei  al.,  1992; 
Krieger  and  Nerbonne,  1991],  lexicography  [Kilgarriif,  1993],  and  theoretical  linguistics  [Flickinger 
and  Nerbonne,  1992;  Gibbon,  1990;  Reinhard,  1990;  Cahill,  1993]. 

The  move  to  structured  lexicons  was  motivated,  in  part,  by  the  fact  that  many  current  linguis¬ 
tic  theories  (e.y..  Head  Phrase  Structure  Grammar  [Pollard  and  Sag,  1987])  relegate  much  of  the 
complexity  of  natural  language  to  the  lexicon.  Thus,  linguistic  generalizations  about  phon<Jogy, 
morphology,  and,  increasingly,  syntax  and  semantics  are  being  made  in  the  lexicon.  Another  reason 
for  the  move  to  structured  lexicons  is  that  the  use  of  machine  readable  dictionaries  and  c(»porar 
based  linguistics  has  made  it  possible  to  construct  large  lexicons  relatively  easily  (e.g.,  [Copestake 
ei  ai.,  1991]).  Large  lexicons  make  issues  like  ease  of  maintenance  and  efficient  use  of  memory  more 
important.  Inheritance  hierarchies  are  one  way  to  address  these  issues. 

Many  of  the  linguistic  generalizations  that  need  to  be  made  in  the  lexicon  involve  a  cluster 
of  properties  and  often  a  hierarchy  of  these  clusters.  For  example,  the  group  of  verbs  often  called 
transitive  verbs  share  the  following  properties:  they  require  a  subject  and  an  object,  the  subject  must 
be  a  noun  phrase,  the  subject  must  have  nominative  case,  the  object  is  usually  a  noun  phrase,  and 
the  object  usually  has  accusative  case.  Feature-based  default  inheritance  hierarchies  use  features  to 
represent  the  properties  of  a  word  and  classes  to  group  features.  Features  are  most  often  some  variant 
of  attribute-value  pairs  (e.g.,  [subj/case,accusative]*  meeming  that  the  subject’s  case  is  accusative). 
Classes  may  stand  in  an  inheritance  relationship:  the  lower  class  (subclass)  receives  the  features  of 
the  higher  class  (superclass).  In  a  default  hierarchy,  the  inheritance  of  any  particular  feature  may  be 
blocked  by  a  feature  local  to  the  subclass.  In  addition,  many  systems  allow  a  class  to  have  multiple 
superclasses. 

An  example  of  a  lexical  hierarchy  is  presented  in  Figure  1.  It  encodes  generalizations  about 
complementation  and  control  features  of  verbs  in  its  classes  and  inheritance  links.  INCOMPLETE 
and  EQUI  are  classes  in  the  hierarchy.  The  pairs  written  directly  below  these  classes  are  some  of 
the  features  that  can  be  inherited  from  them.  Features  inherited  by  a  class  ceui  in  turn  be  inherited 
from  it  (t.e.,  inheritance  is  recursive).  The  lines  between  classes  represent  inheritance  links.  For 
example,  all  the  features  of  INCOMPLETE  are  inherited  by  EQUI — unless  overridden  by  features 
in  CONTROL  or  in  EQUI. 

When  using  such  structured  lexicons,  a  problem  arises  in  deciding  to  which  classes  a  word  belongs. 

This  is  an  instance  of  the  more  general  problem  of  placing  a  new  class  in  the  most  appropriate  place 

in  a  classification  scheme.  In  order  to  avoid  overloading  the  term  ‘classification’,  I  will  call  this  ‘the  _ - 

insertion  problem.’  An  example  of  the  insertion  problem  is  the  task  of  pl8u:ing  the  ditransitive  form _ 

of  give  into  the  hierarchy  of  Figure  1. 

For  example,  suppose  that  this  form  of  give  (e.g.,  she  gave  him  the  ball)  has  the  following  features. 


’  The  first  element  of  this  pair  is  an  atomic  symbol.  Here  and  throughout  this  report,  the  '/’  hu  only  mnemonic  ' 

relevance.  ^  ^ 
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Figure  1:  Complementation  and  control  features  of  verbs  (adapted  from  [Flickinger,  1987]) 


(1)  [subj/cat.N] 

[8ubj/ca8e,nom] 

[8ubj/complete,+] 

(dobycat.N) 

[dobj/ca8e,acc} 

[dobj  /complete, +] 

[iobj/cat,N] 

[iobj/caae,acc] 

[iobj /complete ,+] 

A  solution  to  this  instance  of  the  insertion  problem  would  be  to  place  ^tve  under  the  TRANSITIVE 
and  3-1  classes.  By  doing  so,  ail  of  the  features  listed  above  could  be  inherited  except  [iobj/cat,N]. 
This  feature  would  have  to  be  listed  directly  in  the  entry  for  give  so  that  the  incorrect  default 
inheritance  of  [iobj/cat,P]  from  the  3-1  class  would  be  blocked.  Thus,  only  three  pieces  of  information 
need  to  be  list^  in  the  entry  for  the  ditransitive  verb  give:  it  inherits  from  TRANSITIVE,  it  inherits 
from  3-1,  and  it  has  the  feature  [iobj/cat,N].  Notice  that  if  we  had  inserted  the  entry  under  any  other 
set  of  classes,  more  information  would  have  had  to  be  listed  in  the  entry.  The  main  characteristic  of 
a  good  insertion  is  that  it  minimizes  the  amount  of  information  that  needs  to  be  listed  in  the  entry. 

In  lexicon  research,  the  need  for  an  automated  insertion  system  arises  in  at  least  two  situations. 
First,  when  designing  a  structured  lexicon,  it  is  necessary  to  test  the  generalizations  the  structure 
represents.  One  can  do  this  by  inserting  large  numbers  of  words  and  noting  how  well  the  words  fall 
into  the  classes  of  the  structure.  If  there  are  a  large  number  of  words  that  have  similar  features 
but  do  not  fall  into  a  single  class  then,  perhaps,  a  class  should  be  created  for  them.  Or  if  a  class 
is  seldom  used  by  the  words  for  which  it  was  created,  then  perhaps  it  should  be  modified  in  some 
way  to  better  fit  these  words  (this  situation  often  arises  when  two  or  more  classes  overlap  in  the 


2 


generalisations  they  represent).  Second,  when  using  a  finished  hierarchy  or  a  set  of  hierarchies 
to  build  a  large  lexicon,  one  must  transfer  large  numbers  of  lexical  items  from  raw  data  files  or 
some  other  classification  scheme  to  the  hierarchies.  For  example,  in  the  ELWIS  project  [Feldweg 
and  Storrer,  1992]  a  relational  database  is  used  to  store  application-independent  information  which 
has  been  extracted  from  corpora  and  a  machine  readable  dictionary.  If  the  target  application  uses 
a  feature-based  default  hierarchy,  then  an  insertion  system  is  needed  to  move  the  data  from  the 
relational  database  to  the  hierarchy.  Figure  2  illustrates  this  situation.  The  first  input,  marked  by 
the  dotted  line,  to  a  general  insertion  system  is  the  specific  hierarchy  into  which  the  items  will  be 
inserted.  After  this  hierarchy  has  been  processed,  the  items  themselves  are  taken  from  the  database 
and  inserted  into  the  inheritance  hierarchy. 


Figure  2:  Transferring  objects  from  a  database  to  an  inheritance  hierarchy 

In  the  following  sections  of  this  report,  I  will  describe  my  approach  to  defining  and  solving  the 
insertion  problem  for  feature-based  default  inheritance  hierarchies.  First,  1  give  an  informal  char¬ 
acterization  of  the  problem  and  argue  that  this  characterization  captures  the  relevant  aspects  of 
the  problem.  Second,  I  formalize  this  characterization  and  in  appendix  A  show  that  the  problem  is 
NP-complete.  This  result  means  that  it  is  unlikely  that  a  computationally  tractable  algorithm  for 
its  solution  will  be  found.  Next,  I  describe  an  approximation  algorithm  for  the  problem  and  through 
complexity  analysis  and  discussion  of  preliminary  experimentation,  I  argue  that  this  algorithm  pro¬ 
duces  reasonable  results  in  an  acceptable  amount  of  time  and  space.  This  algorithm  was  modified 
slightly  so  that  it  would  produce  insertions  for  hierarchies  encoded  in  the  DATR  formalism.  This 
modified  algorithm  is  part  of  a  prototype  of  the  INSYST  system  [Light  et  ai,  1992]  which  inserts 
lexical  items  taken  from  the  ELWIS  relational  database  [Feldweg  and  Storrer,  1992]  into  feature- 
based  default  hierarchies  built  in  the  DATR  formalism.  Appendix  B  contains  example  runs  of  the 
INSYST  system. 

Although  this  report  focuses  on  the  use  of  default  inheritance  hierarchies  in  natural  language 
lexicon  research,  the  results  described  here  are  relevant  to  any  system  that  uses  default  inheritance 
hierarchies  and  requires  automated  insertion. 


2  Informal  Characterization  of  the  Problem 

A  good  insertion  places  an  object  under  classes  from  which  it  can  inherit  most  of  its  features.  In 
addition,  it  should  use  as  few  classes  as  possible  to  achieve  this  goal.  These  two  requirements 
are  aspects  of  a  basic  principle  of  classification  schemes:  reduce  redundancy.  A  coroll2try  of  this 
principle  is  that  a  good  insertion  should  minimize  the  amount  of  information  stored  in  the  entry 
for  the  object  being  inserted  and  thereby  maximize  the  use  of  the  information  contained  in  the 
inheritance  relationships.  Three  types  of  information  are  stored  in  an  entry:  its  superclass(es),  the 
features  that  are  not  inherited  from  any  superclass,  and  the  features  that  are  needed  to  block  an 
incorrect  inheritance.  Thus,  an  optimal  set  of  superclasses  for  the  object  being  inserted  has  the 
smallest  possible  value  for  the  following  sum. 
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(2)  nombet  of  tuperclMses  4-  number  of  object  features  that  are  not  in  these  superclaases  +  the 
number  object  features  that  must  be  listed  to  block  incorrect  inheritance 


I  will  refer  to  this  sum  as  the  cost  of  the  insertion  or  solution. 

At  first  glance,  one  might  think  that  one  easy  way  to  find  an  optimal  solution  is  to  start  at  the 
roots  of  the  hierarchies  that  make  up  a  structured  lexicon  and  simply  walk  down  these  hierarchies, 
pruning  off  the  branches  below  a  class  that  contains  a  feature  that  the  object  does  not  have  (since 
all  clauses  below  this  class  will  also  have  this  feature).  This  i^jproach  will  work  for  strict  inheritance 
hierarchies.  Strict  hierarchies  do  not  allow  default  inheritance — all  the  subclasses  below  a  given  class 
have  to  inherit  all  its  features.  The  algorithm  outlined  above  uses  this  characteristic  to  cut  down  the 
search  space  for  superclasses.  However,  default  hieruchies  do  not  have  this  characteristic:  if  a  class 
A  inherits  from  a  class  B,  the  set  of  features  associated  with  A  might  not  be  a  superset  of  the  set 
associated  with  B.  Thus,  an  insertion  algorithm  for  default  hierarchies  cannot  ignore  A  as  a  possible 
superclass  for  an  object  simply  because  it  has  decided  B  is  unsuitable;  A  might  be  an  exception  to  B 
in  just  the  right  ways  (see  Figure  3).  Because  of  these  considerations,  for  the  purposes  of  insertion, 


B 

(al,vl] 

(a2.v2] 

[a3,v3j 


A 

[al,v5] 


features  of  the 
object  to  be  inserted 
{[al,v5][s3,v2][a3,v3]} 


Figure  3:  Insertion  in  default  hierarchies 


I  make  the  folloyring  claim. 

(3)  Each  class  in  a  default  hierarchy  should  be  viewed  as  the  set  of  features  that  can  be  inherited 
from  this  class. 

(3)  is  a  central  claim  of  the  approach  to  insertion  taken  in  this  report.  (After  1  explain  exactly  what 
is  meant  by  (3),  I  will  defend  the  claim.)  It  amounts  to  compiling  out  the  inheritance  relationships 
so  that  a  hierarchy  becomes  a  set  of  sets  of  features.*  ‘Compiling  out’  the  inheritance  relationships  is 
a  process  of  pushing  features  down  the  inheritance  links  to  the  classes  below  so  that  all  the  features 
of  a  class  are  explicitly  listed  in  the  data  structure  for  the  class.  For  example,  compiling  out  the 
hierarchy  in  Figure  1  would  produce  the  set  of  sets  listed  in  (4).  The  first  set  corresponds  to  the 
COMPLEMENTATION  class,  the  second  to  INCOMPLETE,  the  third  to  COMPLETE,  the  fourth 
to  TRANSITIVE,  and  the  fifth  to  3-1. 


*  AdtUtional  infonnation  such  as  the  relative  height  from  whidi  a  feature  is  inherited  can  be  computed  during  this 
compilation  proceee  and  attached  to  a  feature  via  a  weight.  These  weights  could  then  be  used  when  computing  the 
cost  of  a  solution.  Such  weights  would  be  one  way  to  add  bade  a  small  amount  of  hierarchical  information  without 
making  (3)  vacuous. 
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(4)  {{} 

{[complete,-] 
[subj/cat,N] 
[subj/case,nom] 
[subj/complete.-f]} 
{[complete, +]} 
{[complete,-] 
[8ubj/cat,N] 
[subj/caee,nom] 
[subj/complete,-|-] 
[dobycat,N] 
[dobj/case,acc] 
[dobj/complete,-f-] } 
{[complete,-] 
[8ubj/cat,N] 
[8ubj/case,nom] 
[8ubj/complete,-H] 
[iobj/cat,N] 
[iobj/ca8e,acc] 
[iobj/complete,-t-]} 
.... 


Now  that  we  have  a  better  idea  of  what  is  meant  by  (3),  I  will  argue  that  (3)  should  be  adopted 
when  dealing  with  the  problem  of  insertion.  Intuitively,  it  might  seem  that  reducing  a  hierarchy  to  a 
flat  set  of  sets  defeats  the  purpose  of  the  hierarchy.  This  intuition  exists  because  most  hierarchies  are 
built  so  that  lower  classes  represent  sub-generalizations  of  the  generalizations  represented  by  higher 
classes.  These  relations  between  generalizations  are  used  by  the  hierarchy  to  reduce  redundancy. 
By  flattening  the  hierarchy,  one  loses  these  relationships  and  thus  redundancy  increases,  which  is 
usually  bad.  This  intuition  is  correct  when  one  considers  storage  and  maintenance  of  information. 
More  succinctly,  if  one  uses  a  set  of  sets  instead  of  a  hierarchy  to  represent  information,  the  ability 
to  represent  higher  order  generalizations  is  lost. 

However,  when  one  is  concerned  with  insertion,  the  loss  of  higher  order  generalizations  is  irrel¬ 
evant.  It  is  irrelevant  because  if  one  accepts  that  a  good  insertion  minimizes  the  space  needed  to 
store  the  properties  of  the  object  being  inserted,  then,  with  respect  to  insertion,  a  class  is  simply  a 
chance  to  save  space  by  storing  one  class  name  instead  of  a  number  of  features.  Thus,  what  features 
can  be  inherited  from  a  class  is  the  only  characteristic  of  a  class  relevant  to  insertion.  In  order  to 
compute  which  features  can  be  inherited  from  any  given  class,  one  has  to  look  at  the  entry  for  the 
class  and  its  superclasses  and  in  turn,  their  superclasses  etc.  At  each  level,  the  features  of  the  classes 
above  are  passed  down  to  the  classes  below.  This  is  the  processes  of  compiling  out  the  hierarchy. 

In  response  to  this  line  of  argumentation,  one  might  counter  that  the  hierarchy  can  be  used 
to  reduce  the  search  space  of  possible  superclasses.  As  mentioned  above,  this  is  true  for  strict 
hierarchies.  However,  as  illustrated  in  Figure  3,  this  is  not  the  case  with  default  hierarchies.  As 
a  result,  the  inheritance  links  between  classes  in  a  default  hierarchy  lose  their  usefulness  after  it 
has  been  determined  which  features  can  be  inherited  from  each  class.  Thus,  when  dealing  with  the 
insertion  problem  in  default  hierarchies,  one  should  think  of  the  hierarchy  as  a  set  of  sets,  each  of 
which  corresponds  to  a  possible  superclass  of  the  object  being  inserted.  The  insertion  problem,  then, 
amounts  to  picking  an  optimal  subset  of  this  set  of  sets;  the  definition  of  optimal  remains  the  same. 


5 


3  Formal  Definition 


In  thin  section,  I  will  formalize  the  intuitive  characterization  of  the  previous  section.  The  purpose 
of  this  formalization  is  to  provide  a  basis  for  the  analysis  of  the  complexity  of  the  problem  and 
ultimately  the  development  of  an  algorithm  to  produce  solutions  to  the  problem. 

The  first  task  is  to  give  a  formal  characterization  of  the  features  we  have  been  discussing  infor¬ 
mally  in  the  previous  section.  Many  different  feature  systems  are  used  by  linguists  and  computa¬ 
tional  linguists.  Two  feature  systems  in  use  today  are  typed  feature  structure  [Pollard  and  Sag,  1987; 
Carpenter,  1992]  and  the  features  used  in  DATR  [Evans  and  Gazdar,  1990].  These  systems  grew  out 
of  previous  work  in  GPSG  [Gazdar  et  ai,  1985]  and  categories  in  general  [Gazdar  et  a/.,  1988]  which 
in  turn  had  roots  in  feature  systems  introduced  in  generative  syntax  [Chomsky,  1965]  and  generative 
phonology  [Chomsky  and  Halle,  1968].  Although  I  will  work  with  a  specific  feature  system,  which  I 
will  introduce  shortly,  the  algorithms  and  proofs  that  I  will  present  here,  generalize  to  any  feature 
system  with  the  following  property:  whether  two  features  clash  can  be  computed  in  time  polynomial 
in  the  size  of  the  features.^  Intuitively,  two  features  clash  if  they  are  inconsistent.  For  example, 
the  feature  [fly,-(-]  clashes  with  the  feature  [fly,-].  Different  feature  systems  implement  this  notion 
in  slightly  different  ways.  The  two  feature  systems  just  mentioned  have  this  property.  In  fact,  for 
typed  feature  structures,  an  algorithm  for  unification  exists  that  is  just  over  linear  in  the  size  of  the 
features  [Martelli  and  Montanari,  1982].  As  we  will  see  in  a  moment,  unification  and  checking  for 
clashes  are  closely  related. 

In  the  remainder  of  this  paper,  I  will  assume  the  following  feature  system.  A  feature  is  a  pair 
of  atomic  symbols  (e.g.  [a,v]).  The  first  element  is  taken  from  a  set  of  attributes  (ATTRIBUTES) 
and  the  second  from  a  set  of  values  (VALUES).  For  generality,  I  will  assume  that  these  sets  are 
infinite.  The  set  VALUES  includes  a  symbol  ?  which  intuitively  specifies  that  the  corresponding 
attribute  is  undefined  or  unknown  for  the  object  with  the  feature;  more  on  this  later.  Two  features 
dash  if  their  attributes  are  the  same  but  their  values  ue  different;  this  definition  also  holds  for  the 
value  ?.  Testing  if  two  features  clash  can  be  done  in  constant  time.  I  will  use  clash  to  denote  a 
binary  function  on  sets  of  features  that  produces  the  subset  of  features  from  the  first  set  that  aire  in 
conflict  with  a  feature  of  the  second  (see  Figure  4).  I  assume  that  the  sets  of  features  are  internally 
consistent,  t.e.,  each  set  contains  at  most  one  feature  per  attribute.  c/ash(C,7>)  can  be  computed 
for  the  finite  sets  C  and  V  in  0(|C||Z>|)  time  by  simply  testing  every  pair  formed  from  an  element 
from  C  and  an  element  of  V  for  a  clash.  (An  average  time  complexity  of  0(|C|)  can  be  achieved 
by  hashing  the  set  V  using  attributes  as  keys.)  If  one  views  a  typed  feature  structure  as  a  set  of 
features,  then  the  clash  function  amounts  to  listing  all  the  features  that  do  not  unify. 

With  this  feature  system  in  hand,  we  can  move  on  to  the  problem  of  insertion.  Remember  that 
classes  are  the  ‘nodes’  of  the  hierarchy,  the  superclasses  of  a  class  A  are  the  classes  from  which  A 
inherits,  and  the  subclasses  of  A  are  the  classes  that  inherit  from  A.  An  object  class  is  a  leaf  of  the 
hierarchy  that  represents  a  single  object.  In  this  section,  any  mention  of  an  object  class  refers  to 
the  object  class  being  constructed  for  the  object  being  inserted. 

As  mentioned  above,  I  am  concerned  with  feature-based  default  multiple  inheritance  hierarchies; 
default  in  that  all  inheritance  relationships  are  defeasible  and  multiple  in  that  classes  can  inherit  from 
more  that  one  superclass.  I  assume  that  the  hierarchies  are  unambiguous;  for  any  given  attribute, 
a  node  only  inherits  one  value  for  it.  I  will  discuss  the  issue  of  ambiguity  with  respect  to  insertion 
in  section  5.  I  also  assume,  for  expository  reasons,  that  the  feature  sets  inheritable  from  the  classes 
are  unique:  that  a  given  set  of  features  cannot  be  inherited  from  more  than  one  class  in  a  hierarchy. 
The  inheritance  hierarchy  in  Figure  1  with  the  object  node  for  give  under  TRANSITIVE  and  3-1  is 
an  example  of  a  feature-based  default  multiple  inheritance  hierarchy.  The  OBJ-RAISING  class  and 


^The  features  1  will  use  in  this  report  are  all  of  the  same  constant  size.  However,  in  some  feature  systems,  features 
mi^  have  complex  attributes  and  values  and  thus  may  vary  in  size. 


{[al,vl] 

B  =  {[al,v5] 

[a2,v2] 

[a2,v2] 

[»3,v3] 

[a3,v20] 

[a4,v4]} 

[a7,v7] 

[a9,vl2]} 

clash{A,B)  =  {[aljvl] 
[a3,v3]} 

Figure  4:  An  example  of  the  clash  function 


the  class  for  give  both  exemplify  the  multiple  inheritance  characteristic.  The  feature  [dobj/cat,N] 
in  the  give  class  exemplifies  the  defeasible  nature  of  the  inheritance  relationship.^ 

Further,  I  am  concerned  with  cautious  insertion;  only  the  known  features  of  the  object  being 
inserted  should  be  inherited.  An  adventurous  insertion  would  allow  an  object  class  to  inherit  extra 
features  from  its  superclasses.  I  view  adventurous  insertion  as  an  extension  of  cautious  insertion.^  In 
order  to  deal  with  the  problem  of  inheriting  extra  features  from  superclasses,  I  will  assume  that  the 
symbol  ?,  when  used  as  a  value  in  a  feature,  means  that  the  value  is  unknown  for  the  object  or  that 
the  attribute  is  not  appropriate  for  that  object.  I  will  cadi  such  features  ’-features.  Remember  that 
[a,?],  like  any  other  feature,  clashes  with  features  that  have  the  same  attribute  but  a  different  value. 
Only  object  classes  are  adlowed  to  contadn  ?-features.  Figure  5  illustrates  both  forms  of  insertion 
and  the  use  of  the  ?  value.  In  the  case  of  adventurous  insertion.  Object  1  would  have  the  feature 
[a3,v3]  whereas  with  cautious  insertion,  its  inheritance  would  be  blocked  by  [a3,?]. 

In  the  previous  section,  I  defined  the  insertion  problem  so  that  solutions  could  be  incomplete: 
not  all  the  features  of  the  object  being  inserted  need  be  inherited.  Of  course,  a  solution  pays  a  price 
for  forcing  a  feature  to  be  listed  in  the  object  class.  The  second  term  of  the  sum  in  (2)  specifies  the 
price  paid.  Because  of  the  possible  existence  of  clashes  (the  third  term),  an  optimal  solution  may  be 
incomplete.  Such  incomplete  insertions  complicate  the  reduction  needed  to  prove  NP-completeness 
and  the  statement  of  the  insertion  algorithm.  To  circumvent  this  difficulty,  I  will  assume  that  all 
hierarchies  contain  a  singleton  class  for  each  non-?-feature  of  the  object  being  inserted.  (I  will  leave 
these  sets  out  of  the  figures  of  this  report.)  A  singleton  class  is  a  class  from  which  exactly  one 
feature  can  be  inherited  and  which  has  no  superclasses  and  no  subclasses  other  than  object  classes. 
Listing  a  singleton  class  as  a  superclass  in  an  object  class  is  equivalent  to  simply  listing  the  feature 
of  the  singleton  class.  This  assumption  allows  me  to  require  complete  insertions  while  still  being 
able  to  represent  the  notion  of  incomplete  insertion  since  luiy  insertion  that  contains  a  singleton  set 
represents  an  incomplete  insertion. 

An  instance  of  the  insertion  problem  (IN)  is  a  pair  made  up  of  a  set  and  a  set  AT  as  defined 
below.  Note  that  only  a  finite  number  of  features  is  needed  to  specify  ^  and  that  there  is  a  one-to-one 
mapping  between  the  sets  in  Af  and  the  classes  of  the  hierarchy. 


*The  blocking  of  a  default  inheritance  can  occur  between  any  two  rlamiee  in  the  hierarchy,  one  of  the  claeeee  need 
not  be  an  object  class. 

^The  algorithm  developed  in  the  following  section  would  require  only  slight  modification  to  produce  adventurous 
insertions. 
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Adventurous 

insertion 


Cautious 

insertion 


Object  1 


Object  1 
[•3.?] 


the  original  features  of  Object  1 

{(*l,vl]  [a2,v2]} 

Figure  5:  Adventurous  vs.  cautious  insertion 


T  contains  the  features  of  the  object  being  inserted.  T  must  be  complete  in  the  sense  that  for  all 
attributes  in  ATTRIBUTES  there  exists  a  feature  in  T  that  contains  a  value  for  that  attribute. 
However,  T  can  only  have  a  finite  number  of  features  that  contain  values  other  than  ?.  All 
the  other  features  are  ?-feature8. 

H  is  a  set  of  sets  of  features.  must  be  finite  and  each  of  its  elements  must  be  finite.  These  sets 
cannot  contain  ?-features. 

If  we  return  to  our  discussion  of  the  hierarchy  in  Figure  1,  we  see  that  the  set  of  features  of 
ditransitive  ^'ve,  listed  in  (1),  is  T  for  this  hieruchy.  The  set  of  sets  of  features  in  (4)  that  resulted 
firom  compiling  out  the  hierarchy,  plus  singleton  sets  for  the  features  of  T,  is  A/*. 

A  solution  is  a  set  ^  C  at.  V  represents  the  superclasses  for  the  object  whose  features  are  T. 
Every  non-?-feature  in  T  must  be  an  element  of  either  a  set  in  T’  or  cla8h{T,\JV).^  The  features  in 
this  latter  set  have  to  be  listed  explicitly  for  the  object  in  order  to  override  incorrect  inheritance.  An 
optimal  solution  minimizes  the  number  of  superclasses  together  with  the  number  of  clashes  between 
these  superclasses  and  ^  (see  (5)^). 

(5)  |^|+|c/osA(A^,U^)l 

Finally,  the  insertion  problem,  IN,  is  the  problem  of  finding  optimal  solutions  for  instances  as  defined 
above. 

4  An  approximation  algorithm  for  IN 

An  algorithm  for  IN  would  take,  as  input,  instances  of  IN  and  produce,  as  output,  optimal  solutions. 
In  appendix  A,  I  show  that  the  decision  problem  for  IN,  IN^,  is  NP-coraplete.  This  result  implies 

where  4  ie  a  set  of  sets  i*  equal  to  U  v>  u...u<^n. 

^Notice  that  the  first  two  temu  of  (3)  have  been  collapsed  into  the  first  term  of  (5).  This  is  due  to  the  presence 
of  dn^ton  sets. 
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that  it  is  highly  unlikely  that  a  computationally  tractable  algorithm  for  IN  exists.  Instead,  the  best 
one  should  hope  for  is  a  computationally  tractable  algorithm  that  produces  near  optimal  solutions. 

Part  of  the  proof  in  appendix  A  is  a  relatively  straightforward  reduction  of  the  NP-complete  set 
covering  (SC)  decision  problem  to  IN^.^  SC  is  defined  as  follows 

“...the  set-covering  problem  consists  of  a  finite  set  T  and  a  family  M  of  subsets  of  !F, 
such  that  every  element  of  T  belongs  to  at  least  one  subset  of  A/*... We  say  that  a  subset 
S  M  covers  its  elements.  The  problem  is  to  find  a  minimum-size  subset  V  cM  whose 
members  cover  all  of  T”  ([Cormen  el  at.,  1990],  p.974, 1  have  substituted  vuiable  names 
analogous  to  the  ones  I  have  used  for  IN). 

Figure  6  illustrates  the  problem  and  a  solution.  The  following  situation  would  be  a  real  world 
application  of  SC:  a  student  wishes  to  schedule  her  four  years  of  college  such  that  she  takes  the 
smallest  number  of  classes  but  still  satisfies  all  the  requirements  for  a  computer  science  degree. 


f  N 

1.2, 3, 5, 7 

V _ J 


Figure  6:  Set  covering 

IN  can  be  viewed  as  extending  SC  in  two  ways.  First,  it  substitutes  features  for  integers.  Fea¬ 
tures  introduce  the  possibility  of  clashes  between  elements.  This  makes  the  process  of  choosing  V 
more  complex  since  one  must  take  into  account  features  clashed  with  as  well  as  features  covered. 
Second,  intuitively,  IN  loosens  the  restriction  that  all  non-?-elements  in  T  must  be  covered.  How¬ 
ever,  in  the  formalization  of  the  insertion  problem,  I  finessed  this  problem  by  requiring  AT  to  contain 

*SET  COVERING  is  also  know  as  MINIMUM  COVER.  It  was  oripnally  proven  to  be  NP-complete  in  [Karp, 
1972]. 
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aingletoD  sets  for  all  non-?-element8  in  T.  Thus,  the  formalisation  technically  requires  compkte 
coverage  but  retains  the  ability  to  represent  incomplete  covers  since  listing  a  singleton  set  in  the 
object  class  is  equivalent  to  listing  an  uncovered  feature. 

A  polynomial  time  approximation  algorithm  exists  for  SC  and  the  solutions  it  produces  are 
guaranteed  to  be  close  to  the  optimal  solution.  More  formally,  the  ratio  of  the  size  of  the  approximate 
sclution  produced  by  the  algorithm  to  the  optimal  solution  is  bounded  by  the  natural  logarithm  of 
the  size  of  the  set  being  covered  [Lovasz,  1975;  Johnson,  1974;  Chvatal,  1979]; 

(6)  jT^approrl/lT^opttmall  ^  "i"  !• 

In  other  words,  as  the  size  of  the  problem  increases,  the  size  of  the  approximation  will  grow  only 
logarithmically  faster  than  the  size  of  the  optimal  solution. 

The  approximation  algorithm  is  greedy:  at  any  given  point,  it  picks  the  sul»et  that  can  cover 
the  most  features  at  that  time  and  it  never  goes  back  on  this  choice.  Greedy  algorithms  tend 
to  be  fast  but,  since  they  cannot  backtrack,  can  make  local  choices  that  prevent  globally  optimal 
solutions.  Because  of  the  similarities  between  SC  and  IN,  it  seems  likely  that  a  greedy  approximation 
algorithm  will  produce  good  solutions  for  IN.  The  algorithm  Greedy- II  (listed  below)  is  efficient 
and  has  produced  good  solutions  in  the  small  number  of  experiments  I  have  performed  so  far  (see 
appendix  B).  However,  I  have  not  yet  been  able  to  prove  a  ratio  bound  for  it. 

6reedy-ll 

1.  ^temp  {[a,v]l[a,v]e^Av^  ?} 

2.  P  :=  0 

3  •  ^ elaih  '  ~  0 

4.  while  Fttmp  ^  0 

6.  select  that  maximizes  |5nJ^femp|  -  lclos/»(J',5)  - 

6 .  ttmp  ’  *  ^ ttmp  —  (5  U  clash{^ ,  5)) 

7.  ^ela$h  I®  ^eJajfc  U c/fls/l(^ ,S) 

8.  V  :=  7»U{5} 

9 .  return 

10.  list  V  and  in  the  data  structure  lor  the  object 


The  algorithm  works  as  follows.  It  takes  as  input  the  set  of  features  of  the  object  being  inserted, 
and  a  set  of  sets  of  features,  Af,  that  represents  the  classes  of  the  hierarchies.  It  produces  as 
output,  a  list  of  superclasses  and  a  list  of  features  that  must  be  listed  locally  for  the  object.  During 
each  iteration  of  its  main  loop  (lines  4-9),  it  picks  the  most  suitable  superclass  (line  5).  The 
features  from  T  that  can  be  inherited  from  the  new  superclass  S  combined  with  those  that  must 
be  listed  to  block  incorrect  inheritance  are  subtracted  from  T .  This  loop  is  repeated  until  there 
is  no  class  in  Al  from  which  more  features  can  be  inherited  than  must  be  listed  to  block  incorrect 
inheritance.^  The  algorithm  halts  and  returns  the  superclasses  along  with  features  that  must  be 
listed  to  block  incorrect  inheritance.  To  pick  the  most  suitable  superclass,  a  subroutine  is  called  for 
each  class  remaining  in  N  that  computes  the  difference  between  the  number  of  features  remaining 
in  T  that  can  be  inherited  from  the  class  and  the  number  of  new  features  that  would  be  incorrectly 
inherited.^® 


*The  diflerence  being  maximized  in  liB*  S  will  have  a  maximum  value  of  1  or  greater  because  of  the  singleton  sets 
in  Al.  The  ain^eton  sets  that  cover  an  element  of  Tump  will  not  produce  any  clashes  and  thus  will  have  a  value  of 
1  for  the  difference.  When  these  sets  are  exbaiuted,  Tump  must  be  empty  and  thus  the  loop  wiU  return.  Remember 
that  these  singleton  sets  represent  features  that  have  remained  uncovered  and  must  be  listed  individually. 

Another  possibility  is  to  use  the  ratio  of  these  two  counts  instead  of  the  difference. 
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At  this  point,  it  may  be  helpful  to  step  through  an  example  insertion.  Consider  the  hierarchy 
illustrated  in  Figure  7.  It  is  a  fragment  of  a  hierarchy  used  to  classify  German  noun  compounds 
with  respect  to  their  stress  patterns  [Gibbon  and  Bleiching,  1991].  The  compiled-out  form  of  the 
hierarchy,  with  singleton  sets,  is  listed  in  (7)  and  the  features  of  the  object  to  be  inserted,  the  noun 
compound  Burgermeiater,  are  listed  in  (8). 


NOUN 

(syii/cst,noun} 
[moiph/cst^tMt] 
[proa  cat^iill 


NOUN-COMPOUND 

(morph /cat  .compound] 
(pros/cat4eft} 


NOUN.COMPOUND.MARKED 

[pros/cat, right] 

Figure  7:  A  hierarchy  of  features  relevant  to  compound  stress  in  German  (adapted  from  [Gibbon 
and  Bleiching,  1991]) 


(7)  {{[8yn/cat,noun] 

[morph/cat,root] 

[pros/cat,nil]} 

{[syn/cat,noun] 

[morph/cat,compound] 

[pros/cat,left]} 

{[syn/cat,noun] 

[morph/cat, compound] 

[pros/cat, right]} 

{ [determinans,Buerger] } 

{ [determinatum  ,Meister] } 

{[syn/cat,noun]} 

{ [morph /cat  ,compound] } 

{[pros/cat, right]}} 

(8)  {[determinans,Buerger] 

[determinatum,Meister] 

[syn/cat,noun] 

[morph /cat  .compound] 

[pros/cat, right]} 

The  input  to  Gre®dy-II  is  the  set  of  features  being  inserted,  listed,  in  part^^,  in  (8)  and  the 
set  of  sets  of  features,  A*,  listed  in  (7).  The  first  three  steps  of  the  algorithm  initialize  the  variables 


^‘Remember  that  ^  contains  a  potentially  infinite  number  of  ?-feature8.  These  features  are  not  listed  in  (8). 
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to  be  used  in  the  main  loop:  Tump  is  set  to  the  noii-?-feature8  of  T  (i.e.  (8)),  and  V  and  Teiaih^^ 
to  nil. 


After  these  variables  have  been  initialized,  the  main  loop  begins  and  will  continue  until  there 
are  no  more  features  in  Tump-  In  our  example  this  will  only  take  three  iterations.  During  the  first 
iteration  the  nodes  of  the  hierarchy  get  the  following  scores  while  computing  line  6. 

NOUN:  -1 

NOUN.COMPOUND:  1 

NOUN.COMPOUND-MARKED:  3 

The  singleton  sets  all  get  scores  of  1.  NOUN  receives  its  score  of  -1  because  it  covers  one  feature, 
[syn/cat,noun],  but  clashes  with  two  others:  [morph/cat, compound]  and  [pros/cat, right].  Since 
NOUN.COMPOUND-MARKED  has  the  highest  score,  line  5  sets  S  to  it. 

The  next  set  of  lines  (6-8)  compute  the  ramifications  of  this  new  superclass.  In  line  6, 
[syn/cat,noun],  [morph/cat .compound],  and  [pros/cat,right]  are  removed  from  Tump-  Notice  that 
because  of  this  subtraction,  NOUN.COMPOUND-MARKED  will  receive  the  score  0  in  future  it¬ 
erations  and  therefore  will  not  be  chosen  again.  Line  7  adds  nothing  to  Teiath  since  the  fea¬ 
tures  of  NOUN.COMPOUND-MARKED  do  not  clash  with  any  features  in  T.  Finally,  in  lino  8, 
NOUN.COMPOUNDJ^ARKED  is  added  to  P. 

Since  Tump  still  contains  [determinans, Buerger]  and  [determinatum,Meister]  the  loop  continues. 
In  the  second  iteration,  the  following  scores  are  produced  in  line  B. 

NOUN:  0 

NOUN.COMPOUND.  0 

The  singleton  sets  for  the  features  still  in  Tump  receive  scores  of  1  while  the  singletons  for  the 
features  already  covered  get  scores  of  0.  Thus  a  singleton  set  is  the  winner.  The  choice  between 
them  is  arbitrary.  Let  us  assume  the  singleton  set  for  [determinans,Buerger]  is  chosen:  this  feature 
is  removed  from  Tump,  Tdash  remains  unchwged,  euid  the  singleton  is  added  to  V.  In  the  next 
iteration  the  same  thing  occurs  for  the  singleton  set  corresponding  to  the  final  feature  in  Tttmp 
[determinatum,Meister].  After  this  iteration  Tump  is  empty  and  the  loop  terminates. 

The  final  line  of  the  algorithm  produces  the  output  for  the  inserted  object.  In  this  case  only  the 
first  conjunct  applies  since  no  clashes  occurred.  The  output  is  listed  below. 


{[syn/cat,noun] 
[morph/cat, compound] 
[pro6/cat,right]} 

{ [determinans,Buerger] } 

{ [determinatum,Meister] } 


The  first  set  corresponds  to  the  NOUN.COMPOUND-MARKED  class  and  the  other  two  correspond 
to  singleton  sets.  Remember  that  these  singleton  sets  represent  uncovered  features. 


i*  used  to  keep  track  of  the  features  that  the  classes  in  V  clash  with.  This  set  of  fesitures  is  necessary  to 
select  the  new  superclass  since  the  algorithm  does  not  want  to  penalise  a  class  for  clashing  with  a  feature  that  has 
already  been  dashed  with  by  a  member  of  V. 
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5  Analysis  of  the  Algorithm 

As  mentioned  above,  Greedy-II  is  a  greedy  algorithm.  At  choice  points,  greedy  algorithms  make 
a  locally-optimal  choice  and  never  backtrack.  As  should  be  expected,  Gready-II  is  fast  but  pro¬ 
duces  sub-optimal  solutions  in  some  situations.  In  this  section,  I  will  first  discuss  how  efficiently 
Grssdy-II  produces  solutions.  Then  I  will  discuss  the  quality  of  the  approximate  solutions  pro¬ 
duced.  Greedy-Il  runs  in  time  polynomial  in  the  size  of  the  encoding  of  the  input  pair  (TJJ),  more 
specifically,  0(min{|.?Vion-?|i  |-Ar|}l-?>i<m-?||A/’|)  where  Tnon-7  represents  the  set  of  non-?  features 
in  To  see  this,  consider  the  while  loop  (lines  4-9).  It  can  have,  at  most,  |.V|} 

iterations.  This  is  because,  in  the  worst  case,  either  all  the  features  in  Tnon-’>.  will  be  covered  one 
by  one  or  all  the  classes  in  M  will  have  been  used.  Each  iteration  has  a  maximum  duration  of 
0(|^non-?||<A/^|)-  This  is  because,  in  order  to  choose  the  winning  node  S  in  line  6,  one  has  to 
compare  every  feature  in  .^non-?  to  every  clara  in  M  to  see  if  M  contains  the  feature  or  clashes  with 
it.  (This  comparison  can  be  done  in  constant  time  since  the  features  in  a  set  in  AT  can  be  stored 
in  an  array  where  the  attributes  serve  as  indices  into  the  array.  An  array  can  be  used  since  each 
set  in  M  is  finite  and  only  has  one  feature  per  attribute.)  Thus,  the  time  complexity  of  the  overall 
algorithm  is  0(min{|/‘„o„_?|,  |A/’|}|.^non-?llA/’|).  The  space  complexity  is  also  polynomial  in  the 
size  of  the  input:  0{Tnax{\Tnon-?\,  |A/’||5ma*|))  where  Smax  is  the  largest  element  of  AT.  To  see  this 
notice  that  the  algorithm  must  keep  track  of  what  has  happened  to  a  feature  mTnm-?  &nd  it  must 
have  an  array  of  size  [iSmaf  I  to  store  the  features  of  each  class  in  A/*.  This  space  could  be  reduced  in 
practice  by  using  hash  tables  instead  of  arrays  to  store  the  features  for  a  class  in 

As  for  the  relationship  between  the  cost  of  a  solution  ].  jduced  by  Greedy-II  and  the  cost 
of  a  corresponding  optimal  solution,  at  this  point  in  time,  I  can  offer  neither  a  theoretical  proof 
of  a  logarithmic  bound  nor  extensive  experiment^  results  showing  good  performance  (however,  a 
prototype  implementation  has  performed  well  in  small  pilot  studies  (see  appendix  B)). 

Instead,  I  will  discuss  the  basic  situation  that  causes  sub-optimal  results  to  be  produced  by  greedy 
algorithms  and  two  standard  problems  for  default  inheritance  hierarchies  from  [Touretzky,  1986]: 
redundant  links  and  nixon  diamonds.  A  system  that  utilizes  default  inheritance  hierarchies  must 
address  these  problems  since,  if  left  unattended,  they  can  produce  inconsistencies  and/or  unexpected 
behavior  in  the  system.  I  will  discuss  these  problems  in  the  context  of  insertion. 


A 

B 

c 

[al,vl] 

(al.vl) 

[a3,v3) 

[a2,v2] 

(a2,v2] 

[a4,v4] 

[a3,v3] 

[aS.vS] 

[a6,v6] 

(a4,v4j 

^  —  {[sl.vl]  [a2,v2]  [a3,v3]  [a4,v4]  [s5,v5]  [a6,v6]} 

Figure  8:  Problematic  IN  instance  for  Greedy-II 

An  example  of  the  basic  sub-optimal  solution  producing  situation  is  illustrated  in  Figure  8.  The 
problem  here  is  that  class  A  is  seductive.  It  covers  a  large  number  of  features  from  T,  more  than  B 
or  C.  However,  if  A  is  chosen  it  is  still  necessary  to  pick  B  and  C  or  the  two  appropriate  singleton 
sets  to  cover  the  features  not  covered  by  A:  [a5,v5]  and  [a6,v6].  If  an  algorithm  can  withstand  the 
temptation  of  class  A  and  instead  go  with  B  or  C,  this  algorithm  is  rewarded  by  simply  choosing  the 
other  class  (either  B  or  C)  to  cover  the  rest  of  .F.  The  solution  set  {B,C}  is  smaller  than  {A,B,C} 
and  since  both  cover  all  of  F  and  do  not  produce  clashes,  {B,C}  is  superior.  However,  Greedy-II 
falls  for  sets  like  A  every  time.  Thus,  Greedy-II  would  produce  {A,B,C}.  It  might  seem  like  one 
could  tweak  the  selection  criteria  for  the  winning  node  (line  6)  to  produce  an  optimal  solution  and 
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this  is,  in  fact,  the  case  for  any  given  instance.  However,  in  general,  it  is  always  possible  to  come 
up  with  an  instance  for  which  these  new  criteria  produce  a  sub-optimal  result.  This  is  due  to  the 
fact  that  the  central  characteristic  of  the  algorithm  stays  the  same:  decisions  are  made  with  only 
local  information  and  no  backtracking  is  performed.  Note  that  such  problematic  instances  do  not 
seem  to  cause  6raady-II  to  produce  wildly  incorrect  solutions,  simply  slightly  sub-optimal  ones.  In 
addition,  it  remains  to  be  seen  how  often  actual  hierarchies  contain  such  situations. 


(al.vl)  [aJ.Yil  [*3,v3] 
(a4,v4]  [aS.vS]  [a6,v6] 
{.7.V71 


(a5.vl2]  {a6,vl3]  {a7,vl4] 
(aS.vS]  (aS.vd]  (alO.vlO] 
[all.vll]  (al2,vI2] 


^  —  {[alfVl]  Ia2,v2]  [a3,v3]  [s4,v4)  [a5,v5] 

(aS,v6]  (a7,v7]  [a8,v8]  [a9,v9]  [alO,vlO] 

[all.vll]  Ial2,vl2]} 

Figure  9:  Redundant  link  producing  situation 

Another  situation  that  is  problematic  for  Greedy- II  is  illustrated  in  Figure  9.  The  compiled-out 
set,  Ni  (minus  the  singleton  sets)  for  this  hierarchy  is  listed  in  (9)  where  the  first  set  corresponds 
to  A  and  second  to  B. 


(9)  M  ={{[al,vl]  {a2,v2]  [a3,v3] 

{a4,v4]  [a5,v5]  [a6,v6] 

[a7,v7]} 

{[al,vl]  [a2,v2]  [a3,v3] 

[a4,v4j  [a5,vl2]  [a6,vl3] 

[a7,vl4]  [a8,v8]  [a9,v9] 

(alO.vlO)  [all.vll]  [al2,vl2]}} 

During  the  first  iteration  of  the  main  loop  of  Greedy-II,  A  will  be  selected  to  be  a  superclass  since 
it  will  have  a  score  of  7  whereas  B  will  only  have  a  score  of  6.  In  the  next  round,  however,  B  will  be 
chosen  since  it  will  then  have  a  score  of  2  which  is  higher  than  the  scores  obtained  by  the  singleton 
sets.  At  this  point  all  the  features  of  will  be  covered.  The  new  hierarchy,  created  by  the  insertion, 
is  illustrated  in  Figure  10.  The  link  from  the  object  class  to  A  is  known  as  a  redundant  link  because 
there  is  already  a  path  to  A  through  B.  The  first  thing  to  notice  is  that  the  features  that  end  up 
on  the  object  class  are  the  correct  ones;  no  inconsistencies  exist.  This  is  a  result  of  the  way  clashes 
are  handled.  The  second  thing  to  notice  is  that  it  is  sub-optimal.  The  optimal  insertion  woui:^ 
list  the  same  three  features  but  only  inherit  from  B.  What  has  happened  is  that  the  choice  of  B 
canceled  out  the  benefits  of  A.  Thus,  A  became  a  useless  superclass  when  B  was  added.  In  fact, 
the  link  to  A  could  be  removed  without  changing  the  features  of  the  object  class.  A  post-processing 
algorithm  could  be  designed  to  search  for  and  eliminate  redundant  links.  However,  the  need  for 
such  a  post-processing  stage  still  has  to  be  viewed  as  a  defect  of  Graady-II.  Contrary  to  claim  (3), 
that  each  class  in  a  default  hierarchy  should  be  viewed  as  the  set  of  features  that  can  be  inherited 
from  this  class,  it  seems  like  Figure  9  illustrates  a  case  where  the  inheritance  links  between  classes 


14 


Object 

[•5.V5] 

[«6,v6j 

[*7.v7] 

Figure  10:  The  redundrint  hierarchy 


are  relevant  for  insertion.  If  one  had  such  hierarchical  information  one  could  favor  lower  nodes  in 
a  hierarchy  over  higher  ones  to  eliminate  the  redundant  link  problem.  However,  it  might  also  be 
possible  to  use  weights  on  features  in  the  sets  of  ^  to  encode  the  relevant  hierarchical  structure 
of  the  classes.  These  weights  would  be  added  during  the  compilation  process.  For  example,  the 
weight  of  a  feature  would  start  at  1  in  the  class  where  it  originates.  It  would  then  increase  slightly 
each  time  it  is  passed  down  to  a  lower  class.  Then,  in  lias  8,  instead  of  adding  1  for  each  feature 
covered,  the  weight  would  be  used.  Thus,  lower  classes  would  be  favored  over  higher  ones.  In  any 
case,  redundant  links  are  a  problem  for  6rssdy-‘II  as  it  currently  stands. 

The  second  standard  problem  for  default  inheritance  hierarchies  is  illustrated  in  Figure  11.  The 

Republican  Quaker 

[inil/agr,+]  lniil/«gr,-J 


Nixon 

Figure  11;  Nixon  diamond 

problem  is  that  it  is  unclear  what  features  the  object  class  should  have.  More  specifically,  should 
the  object  class,  Nixon,  inherit  the  feature  [mil/agr,-f]  from  Rqjublican  or  [mil/agr,-]  from  Quidcer? 
This  type  of  situation  is  know  as  a  Nixon  diamond.  The  question  here  is  whether  Orssdy-ZI  ever 
produces  such  situations.  The  answer  is  no.  Because  clashes  are  handled  by  listing  locally  the  correct 
feature  and  because  T  is  filled  out  by  T-features,  it  is  not  possible  for  a  Nixon  diamond  to  result  from 
a  Grssdy-II  insertion.  Consider  inserting  the  Nixon  object  class  with  an  T  such  that  [mil/agr,?]  € 
T  into  a  hierarchy  that  contains  the  Republican  and  Quaker  classes  as  in  Figure  11.  If  either  the 
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Republican  or  Quaker  class  is  chosen  in  lias  6,  then  a  clash  will  result  and  [mil/a^,?]  will  be  listed 
in  the  object  class  for  Nixon.  If  the  other  remaining  class  were  picked  as  a  superclass  later,  the  local 
listing  of  [mil/agr,?]  would  block  any  possible  inconsistency.  If,  on  the  other  hand,  the  for  Nixon 
contained  [mil/agr ,+]  and  the  Republican  and  Quaker  classes  were  chosen  by  the  algorithm,  then 
at  the  point  when  Quaker  was  processed  by  Grssdy-II,  [mil/agr, +]  would  be  put  on  the  clash  list. 
Thus,  it  would  be  listed  locally  in  the  Nixon  object  class.  An  analogous  sequence  of  events  would 
occur  if  T  contained  [mil/agr,-].  Thus,  Nixon  diamonds  are  never  created  by  Grssdy-ZB. 

In  this  section,  I  started  by  showing  that  Grssdy-II  is  efficient.  Next,  we  looked  at  the  basic 
situation  that  causes  Grssdy-II  to  produce  sub-optimal  insertions.  Although  sub-optimal,  these 
insertions  do  not  seem  to  be  not  wildly  off  the  mark.  Then,  we  looked  at  two  standard  problems 
for  default  inheritance  hierarchies:  redundant  links  and  Nixon  diamonds.  Redundant  links  are 
produced  by  Grssdy-II  and  these  insertions  are  sub-optimal.  However,  the  insertions  do  not  result 
in  inconsistencies  and  the  redundant  link  can  be  removed  without  affecting  the  features  of  any  class. 
Nixon  diamonds  are  never  produced  by  Grssdy-II.  Based  on  these  results  it  seems  that  Grssdy-II 
warrants  further  investigation.  Namely,  further  efforts  should  be  made  (a)  to  prove  a  logarithmic 
ratio  bound  and  (b)  to  perform  large  scale  experimentation  with  actual  objects  and  hierarchies. 


6  Conclusion 

The  work  reported  on  here  is  motivated  in  part  by  the  use  of  default  inheritance  hierarchies  by  many 
researchers  to  structure  large  natural  language  lexicons.  An  insertion  system  would  be  helpful  when 
developing  the  hierarchies  of  a  lexicon  system  and  when  trying  to  use  such  hierarchies  to  structure 
large  amounts  of  data  from  outside  sources.  More  generally,  an  insertion  system  is  likely  to  be  useful 
for  any  inheritance  system.  The  results  presented  here  are  relevant  to  any  system  that  employs 
default  inheritance  hierarchies  and  requires  automated  insertion. 

I  started  this  report  by  discussing  informally  the  problem  of  deciding  where  an  object  belongs 
in  a  feature-based  default  inheritance  hierarchy.  Later,  I  formalized  this  insertion  problem  as  IN. 
The  crucial  aspect  of  this  formalization  is  that  the  hierarchy  is  viewed  as  a  set  of  unrelated  sets  of 
features.  In  other  words,  each  class  is  viewed  simply  as  a  set  of  features  that  can  be  inherited  from  it; 
the  inheritance  relations  have  been  compiled  out.  Two  facts  combine  to  support  this  claim;  i)  a  good 
insertion  minimizes  the  space  needed  to  store  the  object  and  thus,  a  class  is  seen  as  an  opportunity 
to  replace  the  listing  of  a  number  of  features  by  a  single  superclass,  ii)  the  structure  of  a  default 
hierarchy  cannot  be  used  to  reduce  the  search  space  of  potential  superclasses.  Because  IN  is  NP- 
complete,  I  designed  an  approximation  algorithm  for  it;  Graedy-II.  I  showed  that  this  algorithm  is 
efficient  and  then  looked  at  some  of  the  possible  problematic  situations  for  the  algorithm.  Although 
more  analysis  and  experimentation  are  needed,  these  preliminary  results  seem  to  show  that  the 
algorithm  warrants  such  efforts.  Thus,  the  main  contributions  of  this  work  are  a  formalization  of 
the  insertion  problem,  an  NP-completeness  proof,  and  a  promising  greedy  algorithm. 
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8  Appendix  A:  Proving  IN</  is  NP-complete 

An  instance  of  the  decision  problem  for  IN,  INj,  is  a  triple:  a  set  a  set  M,  and  an  integer  B.  T 
and  are  as  defined  for  IN  in  section  3.  B  represents  a  upper  limit  for  the  cost  associated  with 
a  solution.  Strictly  speaking  INj  is  a  language.  An  instance  of  INj  is  not  necessarily  in  INj;  it  is 
merely  of  the  right  form.  An  instance  of  INj  is  actually  in  TNd  if  there  exists  a  solution,  as  defined 
for  IN,  that  has  a  cost  lower  than  B.  The  cost  of  a  solution  V  is  defined  in  (5)  and  repeated  in  (10). 

(10)  \1>\+\clash{T,[JV)\ 

For  expository  reasons,  I  will,  from  this  point  on,  use  Tis,  ^fis,  and  Sin  instead  of  J",  Af,  and  B 
when  discussing  IN<(. 

An  instance  of  the  decision  problem  for  SC,  SC^,  is  also  a  triple.  It  is  made  up  of  the  following 
elements. 


/sc  is  the  set  of  integers. 

A/sc  is  the  set  of  subsets  of  /sc  that  can  be  used  to  cover  /sc-  For  every  element  of  /sc,  there 
exists  at  least  one  element  of  Afsc  that  contains  it. 

Bsc  is  an  integer  which  is  the  upper  limit  on  the  size  of  a  proposed  solution. 


A  solution  for  an  instance  of  SC^  is  a  subset  of  A/sc  that  covers  every  element  in  /sc  •  As  with 
INa,  SC4  is  a  language.  An  instance  of  SC4  is  in  SC<|  if  a  solution  Vsc  exists  that  has  a  cost 
smaller  than  Bsc-  The  cost  for  a  set  Vsc  is  its  cardinadity:  |/scl- 


To  prove  that  INrf  is  NP-complete  I  have  to  show  i)  that  a  nondeterministic  polynomial  time 
algorithm  for  INj  exists,  ii)  that  there  is  a  polynomial  time  (in  the  size  of  the  instance)  transforma¬ 
tion  from  SCrf  to  INj.  Let  us  call  this  transformation  from  instances  of  SCj  to  instances  of  INj, 
g  (see  Figure  12).  If  an  instance  x  of  SCj  is  in  SCd  then  g(x)  must  be  in  INj.  If  x  is  not  in  SCj 
then  gfx)  must  not  be  in  INj. 


Instances  of 

SC  ■ 


g(x) 


Instances  of 


IN 


Figure  12:  The  transformation  function  g 

To  prove  (i),  consider  the  following  nondeterministic  algorithm  for  IN^:  pick,  at  random,  a  set 
ViN  from  A//JV  and  compare  its  cost  to  Sjy,  if  it  is  smuler  say  “yes”  otherwise  say  “no”.  This 
algorithm  is  correct  because  if  there  is  a  solution  with  a  cost  lower  than  B/at  then  the  computation 
path  that  corresponds  to  this  solution  will  produce  the  output  “yes” .  The  algorithm  is  polynomial 
in  the  size  of  the  input  because  the  operation  of  picking  a  solution  is  cleuly  polynomial  in  the  size 
of  the  input  and  computing  the  cost  of  this  solution  is  sJso  polynomial.*®  Therefore,  INj  is  in  NP. 

Now  on  to  (ii).  The  transformation  g  contains  a  subroutine  that  maps  each  integer  in  /sc  to 
a  feature  with  that  integer  as  both  the  attribute  and  value  (e.y.  6  -+  [6,6]).  Since  none  of  these 
features  will  have  the  same  attribute  but  different  values,  no  clashes  exist.  Using  this  subroutine,  g 

*^The  Mcond  term  of  the  cost  function  ,  \cla»h{^cLt[J'^CL)\t  may  seem  problematic  because  Tin  is  infinite. 
However,  since  (JPcx,  >■  finite  and  only  a  finite  number  of  features  in  Tjn  are  non-?-features,  it  is  possible  to 
compute  this  term  in  polynomial  time:  run  through  each  feature  in  (JPcz.  and  compare  it  to  the  non-?-features  of 
Tin,  if  >t  clashes  with  a  feature  in  Tin  then  list  the  feature  from  Tin,  if  no  non-?-feature  in  Tjn  has  the  same 
attribute  then  list  the  corresponding  ?-feature,  otherwise  go  on  to  the  next  feature  in  [JPcL- 
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perforins  the  following  mappings  from  sets  of  integers  to  sets  of  features;  Tsc  to  and  Nsc  to 
Min-  Finally,  g  sets  B/at  equal  to  Bsc- 

g  can  be  computed  in  polynomial  time  with  respect  to  the  input  size  since  the  most  complicated 
thing  it  does  is  map  integers  to  pairs  of  integers.  We  now  have  to  show  that  am  x  is  in  SCj  iff  gfzj 
is  in  INj.  To  see  this,  notice  that  for  adl  x  that  are  well-formed  instamces  of  SC,i,  the  sets  in  g^z)  do 
not  contain  any  features  that  clash  with  any  other  features  in  these  sets.  This  means  that  the  cost 
of  a  solution  P/n  for  g{xj  is  \Pin\  (>-c-i  the  clash  term  drop>s  out).  This  is  the  same  cost  function  ats 
for  SCrf.  In  addition,  for  any  x  and  g(x),  Tsc  is  isomorphic  to  Tin  and  Msc  to  Min-  Therefore, 
for  any  solution  Vsc  for  x,  a  solution  Vin  of  the  same  size  exists  for  g(x).  Thus,  a  solution  Vsc 
exists  that  is  smaller  than  Bsc  iff  a  Vin  exists  that  is  smadler  tham  B/jv-  Therefore,  x  is  in  SCj  iff 
g(x)  is  in  IN<|.  Thus,  IN,i  is  NP-complete. 


9  Appendix  B:  Examples  of  INSYST  in  action 


This  appendix  contains  the  input  and  output  of  actual  runs  of  the  INSYST  program.  The  algorithm 
is  similar  to  6r««dy-II  but  not  exactly  the  same. 

Input  theory  (hierarchy): 


IttttltlllttXXtlllXXtlltttXtlttXttl 

X 

X  latiB.dtr 

X  Parpeaa:  Latin  nona  aorphology  (aftar  laaaady  1902) 

X  Aatkor:  Oarald  Qaadar,  Harck  24  19S9 

X  DocnMBtatlaa:  HILP  adatr 
X  Kalatad  Filaa:  lib  datr 

X  Copyright  (c>  Onivoraity  of  Saaaax  1989.  All  rig^ta  roaarvad. 

X 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxtxxx 

X  X  kara  aodifiad  this  fils  slightly  so  that  iasyst  can  parsa  it.  At 
X  this  point  ia  tiaw  IISTST  caaaot  parfoni  iaharitaaea  cloaara  for 
X  all  tha  diffarant  coablaatioas  of  iaharitaaea  nla  tjrpoa  of  DATA. 

X  (Mare  Light) 

X 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxtx 


DO:  <Bo*  siBg>  "<nva  aing>“ 

<aoai  plar>  ■■  “Cara  plBr>'* 

<»oc  aiag>  “  "<BOB  sing>“ 

<»oe  plnr>  ■■  "<Boa  plnr>" 

<acc  siBg>  »•  ''<Bra  siBg>‘' 

<acc  plar>  "<ara  plur>'‘ 

<gaa  siag>  —  "Kgda  siBg>" 

<gaB  plar>  n  '*<gda  plar>" 

<dat  siag>  oo  “<gda  siBg>“ 

<dat  plar>  »  “<gda  plar>“ 

<abl  siag>  n  ''<gda  siBg>“ 

<abl  plar>  »■  ''<gda  plBr>'' 

<BTa  siag>  »  <'‘<root>’') 

<gda  plBr>  ■■  ("<root>"  iis) 
<BTa  plBr>  ''<gan  siBg>" . 

Dl:  <>  —  DO 

<BTa  aiag>  »  ("<root>"  a) 

<acc  aing>  »  ("<root>**  aa) 

<gda  siag>  ■■  (‘'<root>"  aa) 

<abl  Bing>  «  ("<root>'‘  aa) 
<acc  plBr>  ■■  <"<root>"  aaa) 
<gOB  plBr>  ■■  (•'<root>"  aaraa). 

Dlaa:  <>  Dl 

tgaa  plBr>  ■■  ("<root>"  bb) . 

Dlabos:  <>  ■■  Dl 

<gda  plarh  ■■  (''<root>''  aabBs)  . 

D2;  <>  -■  DO 

<acc  siag>  (“<root>'‘  bb) 
<gda  siag>  “  <"<root>"  oo) 
<gOB  siag>  -■  {"<root>"  ii) 
Cgaa  plBr>  o*  ("<root>"  oorBB). 

D9oos:  <>  “  D2 


22 


<«ce  plar>  •»  (“<root>"  ooa) . 

D2vs;  <>  ^  D2ooa 

<mom  »in$>  “  ("<root>"  ua) 
<*oc  aiiig>  D3:<abl  aing>. 

03b:  O  ■>  03 

<BaB  alng>  ■■  <acc  aing> 

<Bva  plar>  01:<Bva  aiBg>. 

03:  <>  »  00 

<faB  aiBg>  “  <"<root>"  la) 
<(da  aiac>  »  03:<gaa  alBg> 
<Bbl  aiBg>  “  ("<root>"  a) 
<B**  plar>  “  ('‘<root>"  aaa) 
<gaB  plar>  »  Oloai 
<gda  plar>  “  <"<root>"  iboa) . 

03aai:  <>  «  03 

<aee  aing>  ”  (•'<root>"  as) . 

03aa:  <>  »  03aai 

<nva  alng>  03tta:<noai  aing>. 

03ia:  <>  »  03aa 

<Bva  aiag>  "  <gaii  aiiig> . 

03inai:  <>  »  03 

<a»a  ainK>  "  D3:<ii»a  plur> 
<gaa  plnr>  “  ('•<root>"  iuai) . 

03aailaa;<>  OSiaa 

<aec  aiBg>  “  D3aa. 

03iai«>:<>  «•  OSaailoai 

<BTa  aiBg>  <gan  aing>. 

03ia:  <>  »  OSialoa 

<acc  aiag>  ”  ("<root>"  la) 
<abl  aiag>  “  DO. 

03aa:  <>  »  OSaalua 

<abl  aiag>  “  ("<root>’'  aa) . 

03b:  <>  «  03 

<ava  plar>  »  03b. 


OSaia:  <>  »  03 

<gaB  plar>  **  03iaa 

<BTa  plBr>  “  ("<rBot>"  la) . 

03Bi:  <>  —  OSala 

<abl  aiBg>  00. 

04;  <>  »  00 

<gaB  aiBg>  “  ("<root>"  aaa) 
<gda  aiagb  ”  ("<root>"  aa) 
<gaB  plar>  ”  ("<root>''  aaai) 
<gda  plar>  »  03. 

D4b:  0  —  04 

<ava  alBg>  —  03aa 
<acc  alBg>  —  03 
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<tet  ■tag>  ■■  <*'<ro«t>"  nil) . 

Bta:  <>  —  M 

<Kvn  •lig>  ■■  <gd*  slng> 

<iivn  plw>  ■■  <“<root>"  n«) . 

M;  <>  ■-  DO 

<ann  •ia||>  (“<ro«t>''  ••) 

<nee  •!■(>  ■■  <"<ro«t>"  ■) 
<Cte  ■!■«>  ■>  <'‘<ro«t>"  nil) 
<nM.  •!■(>  n  D3 
<atm  plw>  ■■  <•«•  ■!■(> 

<§»m  plnr>  ■■  <“<ra«t>*‘  nna) 
<gA*  plnr>  ■■  ('*<root>“  nbnn) . 


EVeatures  of  the  item  to  be  inserted: 


<a«n  plnr>  »  (vir  nna) 
<aoa  alng>  (via) 

<noc  plnr>  ■■  (vir  ana) 
<noc  alBg>  <«ia) 

<«cc  plnr>  (nlr  nna) 
<dat  aiBg>  ■■  (vir  ii) 
<dat  plnr>  ■>  (vir  Ibna) 
<abl  plar>  (vir  ibna) 
<^a  aiag  >  (vir  ia) 
<gdn  alaig  >  ■■  (vir  ii) 
<nva  piur  >  “  (vir  nna) 
<gda  plnr  >  »■  (vir  ibna) 
<fna  plnr  >  (vir  ina) 
<r«ot  >  ••  vir 
<nv«  aing  >  via 
<nce  alng  >  »  via 
<nbi  alng  >  «  vii. 

The  example  run: 

CaaMBd:  load  latin.dtr 

Co—niid;  inafiln  latin.itBl 

laanrtiiig  [Via]  : 

DO  baa  1  pointa 
D1  baa  -3  polata 
Dina  baa  -2  pointa 
Dlabna  baa  -3  pointa 
D2  baa  0  pointa 
D2ooa  baa  -3  pointa 
D3na  baa  -6  polata 
D2n  baa  1  pointa 
D3  baa  8  pointa 
DSoa  baa  8  pointa 
D3na  baa  8  pointa 
DSla  baa  9  pointa 
DSina  baa  10  polata 
D3aaina  baa  10  polata 
DSiaina  baa  11  pointa 
DSla  baa  11  pointa 
DSno  baa  10  pointa 
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Dta  ku  6  points 
DSaio  hns  •  points 
B3ni  kns  8  points 
D4  hns  3  points 
Mb  hns  0  points 
Mn  hns  3  points 
06  kns  0  points 
88888  Tho  sinnsr  is: 

DSisioi: 

nacoapilod  pnir  list  for  DSisina: 

<>  -■  D3oBina;<> 

tarn  siag  >  D3isina:<csn  siag> 
coapilod  pnir  list: 


<noB 

nlng 

>  ■■  •!• 

<aoB 

plnr 

>  ■■  nlr 

<roc 

sing 

>  — 

<TOC 

plnr 

>  «■  »lr 

<nce 

plnr 

>  ■■  *ir 

<4nt 

sing 

>  ■■  sir 

ii 

<dnt 

plnr 

>  ■■  Tir 

Ibtis 

<nbl 

plnr 

>  ■■  rlr 

ibns 

<gon 

sing 

>  ■■  »lr 

is 

<8dn 

oing 

>  “  tir 

ii 

<nbl 

sing 

>  ■■  »lr 

s 

<nvn 

plnr 

>  “  »lr 

sss 

<8dn 

plnr 

>  *■  rlr 

ibus 

<8on 

plnr 

>  —  *lr 

ion 

<ncc 

sing 

>  “  *lr 

•a 

<nTn 

sing 

non  alng 

>  M 

«!• 

<nvn 

sing 

noa  plnr 

>  M 

vir  ••• 

<aTn 

•Ing 

voc  alng 

>  Mi 

ria 

<nvn 

•ing 

roe  plnr 

>  a» 

vir  SOS 

<nrn 

•ing 

nee  plnr 

>  mm 

vlr  ••• 

<ntn 

sing 

dnt  aing 

>  n 

vir  ii 

<nTn 

•ing 

dnt  plnr 

>  ■■ 

Tir  ibns 

<arn 

sing 

nbl  plnr 

>  *■ 

Tir  ibns 

<ntn 

sing 

gon  alng 

> 

Tir  !• 

<nvt 

•ing 

gdn  aing 

>  ■■ 

Tir  ii 

tnrn 

•ing 

nbl  aiag 

>  Ml 

Tir  0 

<nTn 

•ing 

nrn  plnr 

>  mm 

Tir  ••• 

<nrt 

•lag 

gds  plnr 

>  ■■ 

Tir  Ibna 

<nTn 

•ing 

gon  plnr 

>  ■■ 

Tir  Ina 

<nvn 

•ing 

see  ring 

>  ■■ 

Tir  OB 

00  hns  -3  points 

01  hns  -3  points 

Dina  hns  -3  points 
Dlnbns  kns  -3  points 
03  hns  -3  points 
D3oos  hns  -3  points 
03ns  hns  -3  points 
D3n  hns  -3  points 
03  hns  -3  points 
03oa  hns  -3  points 
D3ns  hns  -3  points 
03is  hns  -3  points 
OOina  hns  -3  points 
03oainB  hns  -3  points 
03isiaB  hns  0  points 
D3ia  has  -3  points 
03oa  hns  -3  points 
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Mk  kM  -3 
B3Mi«  fcM  -3  p«iBta 
nSai  kM  “3  palata 
M  kaa  -3  poiata 
IMai  kaa  -3  paiata 
IMa  kaa  -3  paiata 
OS  kaa  -3  paiata 

Flaal  aatpat 


tia: 

<>  «■  031ala«;<> 
<raat  >  •»  *!» 

<atra  alag  >  ■»  tia 
<aec  aiac  >  "•  *!■ 
<akl  ala*  >  til 
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Input  theory  (hierarchy): 


X 

X  Pilo; 

aorbocro.dtr 

X  Parposo: 

Boaa  iafloctioB  la  Sorbs  Croat 

X  Aathor: 

Qorald  Oaadar,  March  24  1969 

X  Docaaaatation: 

mP  sdatr 

X  lolatod  Filos: 

lib  datr 

X  Copyright  (c)  Uaiaorsity  of  Sassax  1969.  All  rights  rosoraod. 

X 

X  Mt  to  bo  tokoa  sorlooaly  ••  •  llngaiotic  anolyolo: 
X  it  Igaoroa  rolovont  otroso  factors 


X  I  hovo  Modif lo4  this  fils  slightly  so  that  iasyst  caa  paras  it .  At 
X  this  point  ia  tiao  IISTST  eanaot  porfon  iahoritaaco  closnro  for 
X  all  tha  difforoat  coabiaatioas  of  iahoritaaco  ralo  typos  of  DATh. 

X  (Marc  Light) 

X 

xxxxxxxxxxxxxxxxxxxxxxxxxx 


Pnoaa;  <noa 
<goa  sing>  • 
<dat  siBg>  • 
<acc  siag>  • 
<loc  siag>  • 
<loc  plar>  > 
<iBS  siag>  • 
<aoa  plar>  > 
<goa  plar>  > 
<dat  plar>  > 
<acc  plar>  ■ 
<iaa  pl«r>  ■ 
<goBdor>  » 


siBg>  ■■  "<root>" 
*  “<aoa  plar>'' 
Caea  plar> 

<aoa  siBg> 

“  "<dat  siag>‘' 

“  "<dat  plnr>" 

I"  <"<root>"  oa) 

■  <"<root>"  i) 

«  "<goa  siag>'‘ 

-  <"<root>"  iaa) 
>•  "<aaa  plnr>" 

-  "<loc  plar>" 
f oaiaiaa . 


Faoaa:  <>  ■ 
<Boa  siag> 
<acc  siag> 
<Boa  plar> 
<goa  plar> 
<dat  plar> 


Paoan 

-  (•'<root>“  a) 

■  ("<root>"  a) 

■  ("<root>"  a) 

■  ("<root>"  a) 

-  ('‘<roet>"  aai 


Naona:  <>  ■ 
<goa  siBg> 
<dat  sing> 
<acc  plBr> 
<goador>  » 


•  Paoan 

™  <"<root>"  a) 
«■  <"<root>"  a) 
••  ("<root>"  o) 
aascaliaa. 
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laftintM  of  tlie  item  to  be  insoied 


<m«i  siaf  >  ■■  kMt 

<fMI  >  ■■  (kMt  1) 

<4«t  ai«f  >  ■■  (kMt  1) 

<Me  aiag  >  ■■  kMt 

<lac  aiag  >  ■■  (keat  1) 
<lae  plar  >  ■■  (koat  1m) 
<aM  plar  >  (koat  1) 
<fM  plot  >  ■■  (koat  1) 
<4at  plmr  >  ■■  (koat  1m) 
<aec  plar  >  ■■  (koat  1) 
<1m  plar  >  ■■  (koat  1m) 
tfoador  >  ■■  foMlaiao 
<rMt  >  M  koat 
<laa  ala(  >  ■■  koakea 

<MMlBg  >  ■■  bOBO. 

The  example  run: 


Caawaad;  load  aorbocro.dtr 

CtMoad:  iMfllo  aorboero .  Ital 

laaortlas  Cloat]  ; 

Paoaa  hM  11  polata 
Faoaa  kM  1  polata 
Naoaa  kM  3  polata 
MtW  Tho  alaaor  la: 

Paoaa: 

aacoi^llod  pair  Hat  for  Paoaa: 

<aoai  alag  >  ■■  ''<root>" 

<goa  aiag  >  ■■  "<aooi  plur>" 

<dat  alag  >  PBoaa:<aoai  plar> 
<acc  alag  >  ■■  Paoaa ;<aoB  aiag> 
<loc  alag  >  ''<dat  aiag>" 

<loc  plar  >  <»  "tdat  plar>" 
<laa  alag  >  (''<reot>''  o«) 

<aoai  plar  >  ■■  ("<root>"  1) 

<goa  plar  >  ”  "<goa  aiag>" 
<dat  plar  >  (”<root>"  Im) 

<acc  plar  >  ”  "taoB  plur>" 
<laa  plar  >  “  "tloc  plur>" 
<gMdor  >  ■■  f  oelalno 

coflt^Uod  pair  Hat : 

<aoai  alag  >  ”  koat 
<goa  alag  >  ■■  koat  1 
<dat  alag  >  koat  1 
<acc  alag  >  “  koat 
<loc  alag  >  »  koat  1 
<loc  plar  >  »  koat  iaa 
<lM  alag  >  koat  oa 
<aoai  plar  >  koat  1 
<gM  plar  >  ■■  koat  1 
<dat  plar  >  ■■  koat  Im 
<aee  plar  >  ■■  koat  1 
<laa  plar  >  koat  Im 
tgoador  >  M  foalalao 


Pmw  kw  0  polata 
Ikma  kas  -1  poiata 
Raaaa  kaa  -1  poiata 

riaal  oatpat 


O  ■■  Paoaa:<> 

<root  >  ■■  koat 
<iaa  aiag  >  ■■  keahca 
taaaalag  >  ■■  boaa 


Eoat: 


